File system management

ABSTRACT

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for file system management. In one aspect, a method includes receiving a request at a kernel to perform a file operation on a particular file; identifying a flag bit associated with the file; invoking user space file handler to perform particular action when the flag bit is set; determining whether notification that action has been performed by file handler is received; and allowing the file operation on the file when notification is received.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation and claims the benefit under 35 U.S.C. §120 of the filing date of U.S. application Ser. No. 13/350,047, for File System Management, which was filed on Jan. 13, 2012, which claims the benefit under 35 U.S.C. §119(e) of the filing date of U.S. patent application Ser. No. 61/433,180, for File System Management, which was filed on Jan. 14, 2011, and which are incorporated here by reference.

BACKGROUND

This specification relates to file systems.

Requests for particular file operations, for example to read a file, are sent to the operating system kernel (e.g., using a system call). The kernel manages resources (e.g., system hardware as well as particular information such as a database record) available to perform the operations. The kernel allocates resources for performing the requested operation. Conventionally, resources can only be used by a single process at a time. When two processes are running, they can each be waiting for an event of the other to occur in order to proceed. For example, the event that each process is waiting for can be the release of a respective recourse currently possessed by the other process. However, since both processes are waiting for the other, neither will cause the event. This results in a deadlock.

SUMMARY

This specification describes technologies relating to file system management.

In general, one innovative aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving a request at a kernel to perform a file operation on a particular file; identifying a flag bit associated with the file; invoking user space file handler to perform particular action when the flag bit is set; determining whether notification that action has been performed by file handler is received; and allowing the file operation on the file when notification is received. Other embodiments of this aspect include corresponding systems, apparatus, and computer programs recorded on computer storage devices, each configured to perform the operations of the methods.

These and other embodiments can each optionally include one or more of the following features. Invoking the user space file handler includes holding the operation and setting a time for the file handler to perform the action without setting any resource locks. Expiration of the set time without a request for further time results in the kernel performing the file operation. The file operation is a read file operation and the file handler requests the file data from a network location. The file operation is a modify file operation and the file handler copies the file prior to performing the modify file operation being performed. The method further includes identifying i -node information for the file, where the i-node information includes one or more flag bits. The method further includes when the flag bit is set, accessing extended attribute information associated with the file to identify special file handling operation to be performed by file handler.

Particular embodiments of the subject matter described in this specification can be implemented so as to realize one or more of the following advantages. Techniques are provided for invoking user space handlers for kernel space events. As a result, file operations are also intercepted before they are performed. A hold on a request can be initiated while a user space handler is invoked without setting any locks. Thus, deadlocks can be avoided. Invoking user space handlers provides greater flexibility in performing operations than in the kernel space, for example, initiating network connections to retrieve data. Special file handling can be identified for files by examining i-node flags. Using an i-node flag to identify special file handling provides a quick check and reduces input/output (I/O) operations.

The details of one or more embodiments of the subject matter described in this specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of an example system.

FIG. 2 is an example diagram of kernel space interactions.

FIG. 3 is a flow diagram of an example process for performing file operations.

FIG. 4 is an example system architecture.

Like reference numbers and designations in the various drawings indicate like elements.

DETAILED DESCRIPTION

When a file operation request is received at the kernel, for example, as a system call to read a file or to modify a file (write), a determination is made as to whether additional operations are to be performed before allowing the requested file operation. If additional operations are to be performed, the kernel institutes a hold on the file operation, without reserving any resources to the request (e.g., no locks), and sets a timeout timer. An extended attribute of the file identifies special file handling to be performed on the file. A corresponding user space file handler is requested to perform one or more actions within the time provided by the timer. When the timer expires, the hold is released and the requested resources are allocated for performing the requested file operation.

FIG. 1 is a block diagram of an example system 100. The system 100 includes a user device 102 that can communicate with one or more network locations 106 through a network 104.

The user device 102 can represent various forms of digital computing devices, for example, laptops, desktops, workstations, personal digital assistants, tablet computers, and other mobile devices (e.g., smartphones, mobile phones). The network 104 can be, for example, a local area network (LAN), wide area network (WAN), the Internet, or a combination thereof. Communication through the network 104 can be performed using one or more wired, wireless, or a combination of connections. The network locations 106 can be other computing devices, remote storage locations, servers, or other data repositories accessible by the user device 102.

The user device 102 includes both a kernel space 110 and user space 108. In particular, an operating system typically segregates virtual memory into the kernel space 110 and the user space 108. The kernel space 110 is reserved for running the kernel, kernel extensions, and device drivers. The user space 108 includes the memory area where user mode applications work. The kernel represents a bridge between applications and the actual data processing done at the hardware level. The kernel's responsibilities include managing the system's resources. Particular operations can be performed in the user space that cannot be performed in the kernel space and vice versa. For example, network connections can be made in user space but not in kernel space.

In response to a request, for example, to perform a file modification or a file open, the kernel can invoke one or more file handlers in the user space 108 to perform particular actions prior to assigning resources in response to the request. The user space handler can perform operations including opening network connections in order to request data from one or more network locations 106 through the network 104.

FIG. 2 is an example diagram of kernel space interactions 200. A file operation request is received by the kernel 204 (e.g., from a user space application). Based on an analysis of the request, the kernel holds the request and invokes a user space handler 206 to perform one or more actions (e.g., according to a token sent to the user space file handler 206). In particular, the kernel can read an extended attribute 208 associated with the file to identify the particular file operations to be performed, e.g., prior to allowing the requested file operation to proceed. The kernel does not place a lock on any resources. The kernel further can set a timer in which the user space handler 206 has to perform the one or more actions or request additional time to perform the one or more actions.

The user space handler 206 performs the one or more actions and sends the token back to the kernel 204 indicating that the actions have been completed. Alternatively, the user space handler 206 can send a request for more time. The kernel 204 can increase or reset the timer in response to the request for more time. Once notified of the completed action, the kernel 204 releases the hold and assigns resources for performing the requests file operation.

No locks are placed by the kernel 204. Therefore, if the timer runs out without receiving notification or a request for time from the user space handler 206, the hold is released by the kernel 204 and resources are allocated for attempting to perform the requested file operation, after which control returns to the application (e.g., to perform further operations based on the requested file operation, for example, presenting file contents based on a requested read operation).

FIG. 3 is a flow diagram of an example process 300 for performing file operations. A request to perform a file operation is received (302). The request can be a system call to perform various types of file operations. For example, a read operation (e.g., to read a file) or a write operation (e.g., to edit file data). The received request can be sent to the kernel in response to user actions, for example, a user selecting a file to open or a user performing an edit on an open file.

File information associated with the request is identified (304). For example, in order to perform a read or write of a file, the file data is identified according to the file's i-node or other catalog record. This is a unique identifier for the file that is indexed (e.g., a table of i-nodes) to file information including the location of blocks of data in which the file is stored, time stamps, last accessed, owner of file, permissions, size, and/or creation time. The kernel can load the i-node file information into memory. The i-node can also be associated with one or more flag bits.

A determination is made as to whether a particular flag bit is set (306). This particular flag bit indicates that special file handling should be performed. However, as a single bit this does not indicate what processing is to be performed on the file. Instead, when the flag bit is set, the kernel reads the extended attribute associated with the file. The extended attribute is an additional data structure associated with the file that describes the special file handling to be performed. Since the i-node (or corresponding catalog record) is loaded into memory by the kernel, little processing cost is required to check the flag bits (e.g., no additional I/O operations are required). If the flag bit is not set, the kernel can proceed to respond to the request.

For example, the extended attribute can provide metadata about the type of file and what special handling is needed to be performed. For example, the extended attribute can indicate that the particular file is a data-less file (e.g., the file is only virtually present) and that the corresponding file data needs to be retrieved before the requested operation can be performed (e.g., the file data has to be retrieved before the file can be read). In another example, the extended attribute can indicate that the particular file is to be backed up before the requested operation is performed (e.g., to back-up a copy of the file prior to modifying the file). In some implementations, the extended attribute identifies a particular user space handler to be invoked (e.g., designed to perform particular operations).

A user space file handler is invoked to perform the one or more actions indicated by the extended attribute (308). For example, in some implementations, the file handler has already called into the kernel requesting work and is waiting to be assigned a token. Alternatively, the specific user space file handler is invoked by the kernel. The kernel sends the token to the file handler indicating the actions to be performed with respect to the file. The file handler then initiates a thread to perform the requested action. For example, the file handler can open one or more network connections to a destination in which the file data is stored. The file data can then be retrieved and associated with the existing zero data file.

In addition to invoking the user space file handler, the kernel holds the requested operation and sets a timer (310). The timer provides a time limit for the user space file handler to perform the indicated operation. While the timer holds the requested operation, no resources are locked out by the kernel. Thus, no deadlocks can occur from the request (e.g., from a resource request made by the user space file handler). As a result, other requests for resources can be made (e.g., as part of other processes) while the user space file handler is performing the one or more actions.

A determination is made as to whether the timer has expired before receiving a response from the user space file handler (312). If a response is not received, the hold is released (314) and the requested operation proceeds (322). For example, the kernel can allocate the necessary resources for performing the requested operation. The operation may or may not be able to be performed. For example, if the requested file operation is to read a file that is not present, then an error will result from the attempted read operation. However, if the requested file operation is a file modification to an open file, the modification can be performed.

If a response is received from the user space file handler (316) before the timer has expired, a determination is made as to whether or not the action or actions have been completed (318). If the action or actions have not been completed, a request for more time can be sent by the user space file handler. The timer is extended or reset (320). The process then returns to the determination of whether a response is received before the timer expires (312). The hold on the file operation is maintained until the timer again expires.

If the action has been completed, the hold is released and the requested file operation proceeds (322). The user space file handler can indicate that the action has been complete, for example, by returning the received token to the kernel or other notification. The file operation can proceed, for example, by the kernel assigning resources to perform the requested operation. For example, when the request is a read file request, the resources are allocated for reading the file into memory. The file data can then, for example, be presented to the user (e.g., as a document).

Specific examples are provided below to further illustrate the process 300.

In some implementations, data-less data items are created that present the appearance of data items that are not located on the particular user device. For example, items being synchronized or downloaded from other locations (e.g., other user devices or network locations) may not have the data to be synchronized or downloaded stored locally. Different criteria can be used to determine whether or not to download the data. For example, the size of the file, the current transfer rate, the type of data, etc. As such, the data can be stored elsewhere and retrieved when requested by the user. In order to present the appearance of the data, virtual data items can be generated. For example, files can appear to be present on the user device (e.g., as a displayed icon in a particular directory), but the file data is not physically present.

Similarly, in another implementation, data items can be removed from the user device and stored in a network location while retaining the virtual item representation. Again, different criteria can be used to determine whether or not to remove a data item from physically being located on the user device. For example, data items that are accessed infrequently can be removed to save space. Additionally, data items can be removed based on a percentage or amount of free space remaining on the device to store data (e.g., when a threshold storage level is reached).

These virtual data items can appear as any other file, directory, or other data on the file system or can be identified as virtual files (e.g., with a particular badging of an associated icon).

When a user requests one of these virtual data items, for example a file, the data must be retrieved before the file read is attempted otherwise an error will result. Consequently, the read request the flag bit indicates that the file is an empty file and that the file data must be retrieved. The read request is held by the kernel while a particular user space file handler is requested in order to retrieve the associated file data.

The user space file handler performs the network connections to the storage location of the file data (e.g., at a location indicated in the extended attribute data). The retrieved file data is then associated with the data-less data file such that the file now has the file contents (e.g., the data-less file can be modified or replaced). If the user space file handler fails to perform the actions, for example, because the network connections are unavailable, a time out will result in the hold being released by the kernel. However, since there is no data to be read, an error will result. This read error can be presented to the user indicating that the action is unable to be performed at the present time.

Once the file data is retrieved, the user space file handler indicates to the kernel that the actions have been performed and the kernel releases the hold on the read operation. The file is then read, for example, for presentation to the user by an application associated with the file.

In some other implementations, changes to data items are tracked and periodically stored prior to any changes being made. For example, a modification to a data item can trigger a copy of the data item (e.g., a version of the data item) being stored prior to performing the modification. The user device can maintain a collection of versions of the data item for later retrieval by the user.

Thus, in order to store copies of the file prior to modification, the file modification request has to be intercepted and the copy performed prior to allowing the modification operation to proceed.

When a request to perform a file modification operation is received by the kernel, the kernel identifies the flag bit as set. The extended attribute indicates that the file is one to be tracked and that a copy should be made before a file modification operation. In some implementations, this is variable depending on specific criteria. For example, only one copy can be made for a given time period (e.g., 1 hour). Thus, a copy is made of the file prior to the first modification of the file.

However, any subsequent modifications to the same file within that time period are allowed to proceed without copying the file. Thus, a file that changes 10 times within an hour will only be copied the first time after the start of the specified time period. Later requests to modify the file within the specified time period will not trigger the special file handling prior to performing the requested operation.

In some other implementations, when a request to perform a file modification operation is received, a determination is made as to whether a prior operation has been performed on the file within a specified time period. For example, the time of the requested modification can be compared to operations performed on the file within a current fixed window of time. For example, the specified time period can be one of a sequence of fixed time windows (e.g., every hour block of time can form a specified time period regardless of when a prior file modification operation occurred). If there has not been a prior operation on the file within a current time window (or less than a specified number of modifications), the file is copied prior to the modification operation being performed. In some implementations, in addition to the determination with respect to time, the type of requested modification is considered. For example, an edit operation can trigger the copy being made while a name change may not.

If there has been a prior operation on the file within the current time window, the requested modification can be allowed within performing any intervening action (e.g., copying the unmodified version of the file). For example, when only one action (e.g., copy file) per time window is performed prior to a requested modification operation on a file.

If the file is to be copied prior to the modification operation, the user space file handler is called to perform the file copy operation. When the kernel is notified that the operation is complete, the hold on the operation is removed and the file is allowed to be modified.

If a timeout occurs without a response from the user space file handler, the hold is released and the modification is allowed to occur. Thus, no pre-modified version will be stored.

In some other implementations, files can have a flag bit set when the file is to be tracked. For example, when versions of a file are stored as described above, the versions can be tracked to make sure all the versions are linked together. In another example, if a request for a name change is received for a tracked file the name change is held while performing a link between the new file and the earlier versions.

FIG. 4 illustrates an example architecture of a system 400. The system architecture 400 is capable of performing operations for performing file system management. The architecture 400 includes one or more processors 402 (e.g., IBM PowerPC, Intel Pentium 4, etc.), one or more display devices 404 (e.g., CRT, LCD), graphics processing units 406 (e.g., NVIDIA GeForce, etc.), a network interface 408 (e.g., Ethernet, FireWire, USB, etc.), input devices 410 (e.g., keyboard, mouse, etc.), and one or more computer-readable mediums 412. These components exchange communications and data using one or more buses 414 (e.g., EISA, PCI, PCI Express, etc.).

The term “computer-readable medium” refers to any medium that participates in providing instructions to a processor 402 for execution. The computer-readable medium 412 further includes an operating system 416 (e.g., Mac OS®, Windows®, Linux, etc.), a network communication module 418, user space file handlers 422, and other applications 424.

The operating system 416 can be multi-user, multiprocessing, multitasking, multithreading, real-time and the like. The operating system 416 performs basic tasks, including but not limited to: recognizing input from input devices 410; sending output to display devices 404; keeping track of files and directories on computer-readable mediums 412 (e.g., memory or a storage device); controlling peripheral devices (e.g., disk drives, printers, etc.); and managing traffic on the one or more buses 414. The operating system 416 also includes kernel functions. The network communications module 418 includes various components for establishing and maintaining network connections (e.g., software for implementing communication protocols, such as TCP/IP, HTTP, Ethernet, etc.).

The operating system 416, particularly the kernel, and the user space file handlers 422 provide various software components for performing the various functions for holding file operation requests at the kernel and invoking a user space handler to perform one or more actions before releasing or timing out the hold on the requested operation as described with respect to FIGS. 1-3.

Embodiments of the subject matter and the operations described in this specification can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions, encoded on computer storage medium for execution by, or to control the operation of, data processing apparatus. Alternatively or in addition, the program instructions can be encoded on an artificially-generated propagated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus. A computer storage medium can be, or be included in, a computer-readable storage device, a computer-readable storage substrate, a random or serial access memory array or device, or a combination of one or more of them. Moreover, while a computer storage medium is not a propagated signal, a computer storage medium can be a source or destination of computer program instructions encoded in an artificially-generated propagated signal. The computer storage medium can also be, or be included in, one or more separate physical components or media (e.g., multiple CDs, disks, or other storage devices).

The operations described in this specification can be implemented as operations performed by a data processing apparatus on data stored on one or more computer-readable storage devices or received from other sources.

The term “data processing apparatus” encompasses all kinds of apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, a system on a chip, or multiple ones, or combinations, of the foregoing The apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit). The apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, a cross-platform runtime environment, a virtual machine, or a combination of one or more of them. The apparatus and execution environment can realize various different computing model infrastructures, such as web services, distributed computing and grid computing infrastructures.

A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, object, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform actions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for performing actions in accordance with instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device (e.g., a universal serial bus (USB) flash drive), to name just a few. Devices suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, embodiments of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.

Embodiments of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), an inter-network (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks).

The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. In some embodiments, a server transmits data (e.g., an HTML page) to a client device (e.g., for purposes of displaying data to and receiving user input from a user interacting with the client device). Data generated at the client device (e.g., a result of the user interaction) can be received from the client device at the server.

While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features specific to particular embodiments of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.

Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

Thus, particular embodiments of the subject matter have been described. Other embodiments are within the scope of the following claims. In some cases, the actions recited in the claims can be performed in a different order and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous. 

What is claimed is:
 1. A method comprising: by a kernel of a computing device: receiving a request to perform an operation on a file; reading a file attribute when a flag bit associated with the file indicates that special file handling should be performed; determining, based on the special file handling, that a file content needs to be downloaded before the operation can be performed; invoking by the kernel a user space file handler to perform a particular action, wherein the particular action comprises: i) downloading the file content and ii) modifying the file based on the file content to produce a modified file; holding the operation and setting a time for the file handler to perform the particular action without setting any resource locks; determining that the particular action has been performed by the file handler; and allowing the operation to be performed on the modified file.
 2. The method of claim 1, wherein expiration of the set time without a request for further time results in the kernel performing the operation.
 3. The method of claim 2, wherein the kernel executes in a kernel space of a memory of a computer system.
 4. The method of claim 3, wherein the kernel space is reserved for execution of the kernel, kernel extensions, and device drivers.
 5. The method of claim 3, wherein particular operations can be performed in a user space that cannot be performed in the kernel space and vice versa.
 6. The method of claim 1, wherein the operation is a read file operation.
 7. The method of claim 1, wherein the operation is a modify file operation and the file handler copies the file prior to the modify file operation being performed.
 8. The method of claim 1, further comprising: identifying i-node information for the file, wherein the i-node information includes one or more flag bits.
 9. The method of claim 8, further comprising: loading the file attribute into a memory when the file operation request is received; and when the flag bit is set: accessing the file attribute from the memory to identify the particular action to be performed by the file handler.
 10. The method of claim 9, wherein the file attribute identifies a particular user space handler to be invoked, the particular user space handler being designated to perform particular operations.
 11. The method of claim 1, wherein the file attribute indicates that the file is to be backed up before the operation is performed.
 12. The method of claim 2, wherein a user space comprises a memory area in which user mode applications execute and the kernel does not execute.
 13. A system comprising: one or more computing devices operable to perform operations comprising: receiving a request at a kernel to perform an operation on a file; reading a file attribute when a flag bit associated with the file indicates that special file handling should be performed; determining, based on the file attribute, that a file content needs to be downloaded before the operation can be performed; invoking by the kernel a user space file handler to perform a particular action, wherein the particular action comprises: i) downloading the file content and ii) modifying the file based on the file content to produce a modified file; holding the operation and setting a time for the file handler to perform the particular action without setting any resource locks; determining at the kernel that the particular action has been performed by the file handler; and allowing, at the kernel, the operation to be performed on the modified file.
 14. The system of claim 13, wherein expiration of the set time without a request for further time results in the kernel performing the operation.
 15. The system of claim 13, wherein the operation is a read file operation.
 16. The system of claim 13, wherein the operation is a modify file operation and the file handler copies the file prior to the modify file operation being performed.
 17. The system of claim 13, further operable to perform operations comprising: identifying i-node information for the file, wherein the i-node information includes one or more flag bits.
 18. The system of claim 17, further comprising: when the flag bit is set: loading the file attribute into a memory when the file operation request is received; and accessing the file attribute from the memory to identify the particular action to be performed by the file handler.
 19. The system of claim 13, wherein the file attribute indicates that the file is to be backed up before the operation is performed.
 20. The system of claim 13, wherein the file attribute identifies a particular user space handler to be invoked, the particular user space handler being designated to perform particular operations.
 21. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising: receiving a request at a kernel to perform an operation on a file; reading a file attribute when a flag bit associated with the file indicates that special file handling should be performed; determining, based on the special file handling, that a file content needs to be downloaded before the operation can be performed; invoking by the kernel a user space file handler to perform a particular action, wherein the particular action comprises: i) downloading the file content and ii) modifying the file based on the file content to produce a modified file; holding the operation and setting a time for the file handler to perform the particular action without setting any resource locks; determining that the particular action has been performed by the file handler; and allowing the operation to be performed on the modified file.
 22. The non-transitory computer storage medium of claim 21 , wherein expiration of the set time without a request for further time results in the kernel performing the operation.
 23. The non-transitory computer storage medium of claim 21, wherein the operation is a read file operation.
 24. The non-transitory computer storage medium of claim 21 wherein the operation is a modify file operation and the file handler copies the file prior to the modify file operation being performed.
 25. The non-transitory computer storage medium of claim 21, further operable to cause the data processing apparatus to perform operations comprising: identifying i-node information for the file, wherein the i-node information includes one or more flag bits.
 26. The non-transitory computer storage medium of claim 21, further operable to cause the data processing apparatus to perform operations comprising: loading the file attribute into a memory when the file operation request is received; and when the flag bit is set: accessing the file attribute from the memory to identify the particular action to be performed by the file handler.
 27. The non-transitory computer storage medium of claim 21, wherein the file attribute indicates that the file is to be backed up before the operation is performed.
 28. The non-transitory computer storage medium of claim 21, wherein the file attribute identifies a particular user space handler to be invoked, the particular user space handler being designated to perform particular operations.
 29. A method comprising: by a kernel of a computing device: receiving a request to perform an operation on a file; reading metadata associated with the file, the metadata indicating whether a prior operation has been performed on the file within a time period; invoking a user space file handler to perform an action in user space when no prior operation has been performed within the time period; holding the operation and setting a time for the file handler to perform the particular action without setting any resource locks; determining that the action has been performed by the file handler in user space; and allowing the operation to be performed on the file.
 30. The method of claim 29, wherein the request is a file modification and the particular action is to perform a copy of the file.
 31. The method of claim 29, wherein the time period is a first time window of a sequence of time windows, and wherein the action is performed on the file once during each time window of the sequence of time windows.
 32. The method of claim 29, wherein the first time window is an hour block of time. 